Connecting via Winsock to STN 



Welcome to STN International I Enter x:x 
LOGINID : SSSptal649 jxm 
PASSWORD : 

TERMINAL (ENTER 1, 2, 3, OR ?):2 



Welcome to STN International 



NEWS 


1 






Web Page URLs for STN Seminar Schedule - N. America 


NEWS 


2 






"Ask CAS" for self-help around the clock 


NEWS 


3 


May 


12 


EXTEND option available in structure searching 


NEWS 


4 


May- 


12 


Polymer links for the POLYLINK command completed in REGISTRY 


NEWS 


5 


May 


27 


New UPM (Update Code Maximum) field for more efficient patent 
SDIs in CAplus 


NEWS 


6 


May 


27 


CAplus super roles and document types searchable in REGISTRY 


NEWS 


7 


Jun 


28 


Additional enzyme -catalyzed reactions added to CASREACT 


NEWS 


8 


Jun 


28 


ANTE, AQUALINE, BIOENG, CIVTLENG, ENVIROENG, MECHENG, 
and WATER from CSA now available on STN (R) 


NEWS 


9 


Jul 


12 


BEILSTEIN enhanced with new display and select options, 
resulting in a closer connection to BABS 


NEWS 


10 


Jul 


30 


BEILSTEIN on STN workshop to be held August 24 in conjunction 
with the 228th ACS National Meeting 


NEWS 


11 


AUG 


02 


IFIPAT/IFIUDB/IFICDB reloaded with new search and display 
fields 


NEWS 


12 


AUG 


02 


CAplus and CA patent records enhanced with European and Japan 
Patent Office Classifications 


NEWS 


13 


AUG 


02 


STN User Update to be held August 22 in conjunction with the 
22 8th ACS National Meeting 


NEWS 


14 


AUG 


02 


The Analysis Edition of STN Express with Discover! 
(Version 7.01 for Windows) now available 


NEWS 


15 


AUG 


04 


Pricing for the Save Answers for SciFinder Wizard within 
STN Express with Discover! will change September 1, 2 004 



NEWS EXPRESS JULY 30 CURRENT WINDOWS VERSION IS V7.01, CURRENT 
MACINTOSH VERSION IS V6.0c(ENG) AND V6.0Jc(JP), 
AND CURRENT DISCOVER FILE IS DATED 11 AUGUST 2 004 

NEWS HOURS STN Operating Hours Plus Help Desk Availability 

NEWS INTER General Internet Information 

NEWS LOGIN Welcome Banner and News Items 

NEWS PHONE Direct Dial and Telecommunication Network Access to STN 
NEWS WWW CAS World Wide Web Site (general information) 



Enter NEWS followed by the item number or name to see news on that 
specific topic. 

All use of STN is subject to the provisions of the STN Customer 
agreement. Please note that this agreement limits use to scientific 
research. Use for software development or design or implementation 
of commercial gateways or other similar uses is prohibited and may 
result in loss of user privileges and other penalties. 

************* stn Columbus *************** 
FILE 'HOME' ENTERED AT 13:04:37 ON 17 AUG 2004 



=> file medline biosis embase caplus 



SINCE FILE 
ENTRY 
0.21 



COST IN U.S. DOLLARS 
FULL ESTIMATED COST 

FILE ' MEDLINE 1 ENTERED AT 13:04:53 ON 17 AUG 2004 

FILE 'BIOSIS' ENTERED AT 13:04:53 ON 17 AUG 2004 
COPYRIGHT (C) 2 004 BIOLOGICAL ABSTRACTS INC.(R) 



FILE 'EMBASE' ENTERED AT 13:04:53 ON 17 AUG 2004 
COPYRIGHT (C) 2 004 Elsevier Inc. All rights reserved. 

FILE 'CAPLUS' ENTERED AT 13:04:53 ON 17 AUG 2004 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS. 

COPYRIGHT (C) 2 004 AMERICAN CHEMICAL SOCIETY (ACS) 

=> s luo ying/au 

LI 219 LUO YING/AU 

=> s mancebo helena/au 

L2 6 MANCEBO HELENA/AU 

=> s syk (s) ubp (s) protein 

L3 3 SYK (S) UBP (S) PROTEIN 

=> dup rem 13 

PROCESSING COMPLETED FOR L3 

L4 3 DUP REM L3 (0 DUPLICATES REMOVED) 

=> d 14 total ibib kwic 



TOTAL 
SESSION 
0.21 



L4 ANSWER 1 OF 3 
ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE : 



IT 



IT 



IT 



BIOSIS COPYRIGHT 2004 BIOLOGICAL ABSTRACTS INC. on STN 
2004 :237339 BIOSIS 
PREV200400237391 
SYK -UBP proteins, 
compositions and methods of use. 

Luo, Ying [Inventor, Reprint Author] ; Mancebo, Helena 
[Inventor] 
San Bruno, CA, USA 

ASSIGNEE: Rigel Pharmaceuticals, Inc. 
US 6709839 March 23, 2004 

Official Gazette of the United States Patent and Trademark 
Office Patents, (Mar 23 2004) Vol. 1280, No. 4. 
http : //www. uspto.gov/web/menu/patdata. html . e-f ile . 
ISSN: 0098-1133 (ISSN print) . 
Patent 
English 

Entered STN: 28 Apr 2004 
Last Updated on STN: 28 Apr 2004 
SYK-UBP proteins, compositions and methods 
of use. 

Major Concepts 

Methods and Techniques/ Pharmacology 
Diseases 

disease: disease-miscellaneous 

Disease (MeSH) 
Chemicals & Biochemicals 

SYK-UBP proteins: diagnostic-drug; 

nucleic acids; polypeptides 



AUTHOR (S) : 

CORPORATE SOURCE: 

PATENT INFORMATION 
SOURCE : 



DOCUMENT TYPE 
LANGUAGE : 
ENTRY DATE: 



TI 



L4 ANSWER 2 OF 3 
ACCESSION NUMBER: 
DOCUMENT NUMBER: 



CAPLUS COPYRIGHT 2 004 ACS on STN 
2003 : 376376 CAPLUS 
138 :364551 



TITLE : 



INVENTOR (S) : 

PATENT ASSIGNEE (S) 
SOURCE : 



DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



Novel human SYK kinase-associated cell cycle protein 
sequence and modulation of leukocyte activation, 
compositions and methods of use 

Wong, Brian; Fu, Chong Alan; Mancebo, Helena; Zhou, 

Xiulan X. Z . 

USA 

U.S. Pat. Appl . Publ., 88 pp., Cont . -in-part of U.S. 
Ser. No. 39,761. 



CODEN : 
Patent 
English 
3 



USXXCO 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



US 2003092605 
US 2003036107 
PRIORITY APPLN. INFO. 



Al 
Al 



20030515 
20030220 



AB 



for 



IT 



US 2002-112674 

US 2001-39761 

US 2001-280698P P 

US 2001-39761 A2 

US 1999-404967 A 

WO 2000-US26338 W 



20020329 
20011019 
20010329 
20011019 
19990924 
20000925 



The present invention provides compns . and methods for modulating 
leukocyte activation. The invention provides protein and cDNA 
sequences for novel human SYK kinase-associated cell cycle 
proteins SYK-UBP. Nucleic acids encoding 

proteins and proteins so encoded which are capable of modulating leukocyte 
activation are provided. Compns. and methods for the treatment of 
disorders related to leukocyte dysfunction or dysregulation are also 
provided. Prophylactics and methods for the prevention of such disorders 
are also provided. Also provided are compns. and methods for diagnostic 
and prognostic determination of such disorders. Further provided are assays 

the identification of bioactive agents capable of modulating leukocyte 
activation. 

331009-92-8D, Protein SYK-UBP (human isoform 
1) , Subfragments are claimed 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; novel human SYK kinase-associated cell 

cycle protein sequence and modulation of leukocyte 

activation, compns. and methods of use) 



L4 



ANSWER 3 OF 3 CAPLUS COPYRIGHT 2 004 ACS on STN 



ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE : 



INVENTOR (S) : 
PATENT ASSIGNEE (S) : 
SOURCE : 

DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



2001:228922 CAPLUS 
134 :247997 

Novel SYK kinase-associated cell cycle 

proteins SYK-UBPs, 

compositions and methods of use 

Luo, Ying; Mancebo, Helena 

Rigel Pharmaceuticals, Inc., USA 

PCT Int. Appl., 63 pp. 

CODEN: PIXXD2 

Patent 

English 

3 



PATENT NO. 



KIND 



DATE 



WO 2001021654 
WO 2001021654 



20010329 
20020307 



APPLICATION NO. 
WO 2000-US26338 



DATE 
20000925 



A2 
A3 

W: AU, CA, JP, US 

RW: AT, BE, CH, CY, DE, DK, ES , FI, FR, GB, GR, IE, IT, LU, MC, NL, 



PT, SE 

US 6709839 Bl 20040323 US 1999-404967 19990924 

AU 2000078328 A5 20010424 AU 2000-78328 20000925 

EP 1218505 A2 20020703 EP 2000-968406 20000925 

R: AT, BE, CH, DE, DK, ES, FR, GB, GR, IT, LI, LU, NL, SE, MC, PT, 
IE, FI, CY 

JP 2003512820 T2 20030408 JP 2001-525227 20000925 

US 2003036107 Al 20030220 US 2001-39761 20011019 

PRIORITY APPLN. INFO.: US 1999-404967 A2 19990924 

WO 2000-US26338 W 20000925 

US 2001-280698P P 20010329 

TI Novel SYK kinase-associated cell cycle proteins 
SYK-UBPs, compositions and methods of use 

AB The invention provides protein and cDNA sequences for novel 
human SYK kinase-associated cell cycle proteins 
SYK-UBPs, which have an effect on or are related to the 

cell cycle. Also provided herein are vectors and host cells comprising 
those nucleic acid sequences, chimeric polypeptide mols. comprising the 
polypeptides of the present invention fused to heterologous polypeptide 
sequences, antibodies which bind to the polypeptides of the present 
invention and to methods for producing the polypeptides of the present 
invention. Further provided by the present invention are methods for 
identifying novel compns . which mediate cell cycle bioactivity, and the 
use of such compns. in diagnosis and treatment of disease. 
IT Proteins, specific or class 

RL: BPN (Biosynthetic preparation) ; BPR (Biological process) ; BSU 
(Biological study, unclassified) ; PRP (Properties) ; THU (Therapeutic use) 
BIOL (Biological study) ; PREP (Preparation) ; PROC (Process) ; USES (Uses) 
(SYK-UBP, of human; novel SYK 

kinase-associated cell cycle proteins SYK-UBPs 
, compns. and methods of use) 
IT cDNA sequences 

(for protein SYK-UBP, of human; novel 
SYK kinase-associated cell cycle proteins SYK- 
UBPs, compns. and methods of use) 
IT Gene , animal 

RL: BSU (Biological study, unclassified); PRP (Properties); THU 
(Therapeutic use) ; BIOL (Biological study) ; USES (Uses) 
(for protein SYK-UBP, of human; novel 
SYK kinase-associated cell cycle proteins SYK- 
UBPs, compns. and methods of use) 
IT Diagnosis 

(mol.; novel SYK kinase-associated cell cycle proteins 
SYK-UBPs, compns. and methods of use) 
IT Antibodies 

RL: THU (Therapeutic use); BIOL (Biological study); USES (Uses) 
(monoclonal, to protein SYK-UBP; novel 
SYK kinase-associated cell cycle proteins SYK- 
UBPs, compns. and methods of use) 
IT Cell cycle 

Drug screening 

Gene therapy 

Molecular cloning 

Nucleic acid hybridization 

(novel SYK kinase-associated cell cycle proteins 
SYK-UBPs, compns. and methods of use) 
IT Primers (nucleic acid) 
Probes (nucleic acid) 

RL: ARG (Analytical reagent use); ANST (Analytical study); USES (Uses) 
(novel SYK kinase-associated cell cycle proteins 
SYK-UBPs, compns. and methods of use) 
IT Protein sequences 

(of protein SYK-UBP, of human; novel 

SYK kinase-associated cell cycle proteins SYK- 



UBPs, compns. and methods of use) 
IT Antibodies 

RL: THU (Therapeutic use) ; BIOL (Biological study) ; USES (Uses) 
(to protein SYK-UBP; novel SYK 

kinase-associated cell cycle proteins SYK-UBPs 
, compns. and methods of use) 
IT Protein motifs 

(ubiquitin-associated binding domain in protein SYK- 
UBP; novel SYK kinase-associated cell cycle 
proteins SYK-UBPs, compns. and methods of 
use) 

IT Protein motifs 

(ubiquitin-specif ic protease domain in protein SYK- 
UBP; novel SYK kinase-associated cell cycle 
proteins SYK-UBPs, compns. and methods of 
use) 

IT 331009-92-8P, Protein SYK-UBP (human isoform 
1) 331009-93-9P, Protein SYK-UBP (human 
isoform 2) 

RL: BPN (Biosynthetic preparation); BPR (Biological process); BSU 
(Biological study, unclassified) ; PRP (Properties) ; THU (Therapeutic use) ; 
BIOL (Biological study) ; PREP (Preparation) ; PROC (Process) ; USES (Uses) 

(amino acid sequence; novel SYK kinase-associated cell cycle 

proteins SYK-UBPs, compns. and methods of 

use) 

IT 60267-61-0, Ubiquitin 138674-26-7, SYK kinase 

RL: BPR (Biological process) ; BSU (Biological study, unclassified) ; BIOL 
(Biological study) ; PROC (Process) 

(cell cycle protein SYK-UBP interacting 
with; novel SYK kinase-associated cell cycle proteins 
SYK-UBPs, compns. and methods of use) 
IT 331009-94-0 331009-95-1 

RL: BSU (Biological study, unclassified); PRP (Properties); THU 
(Therapeutic use) ; BIOL (Biological study) ; USES (Uses) 

(nucleotide sequence; novel SYK kinase-associated cell cycle 
proteins SYK-UBPs, compns. and methods of 
use) 

IT 331011-23-5 

RL: PRP (Properties) 

(unclaimed protein sequence; novel SYK 
kinase-associated cell cycle proteins SYK-UBPs 
, compns. and methods of use) 



L Number 


Hits 


Search Text 


DB 


Time stamp 


1 


27 


luo-ying.in. 


USPAT; 
US-PGPUB; 
EPO; JPO; 
DERWENT 


2004/08/17 11:55 


2 


4 


mancebo-helena.in. 


USPAT; 
US-PGPUB; 
EPO; JPO; 

FM7D U/CMT 


2004/08/17 11:56 


3 


3 


syk-ubp same protein 


USPAT; 
US-PGPUB; 
EPO; JPO; 
DERWENT 


2004/08/17 11:56 



Search History 8/1 7/04 1 2:36: 1 6 PM Page 1 
C:\APPS\EAST\Workspaces\10088960.wsp 



10088960 Results 



SEQ ID NO: 2 

% 

Result Query- 
No. Score Match Length DB ID Description 



1 


5473 


100 


.0 


1055 


22 


AAB74491 


Human SYK kinase b 


2 


5460 


99 


.7 


1055 


22 


AAB31546 


A human ubiquitin 


3 


5460 


99 


.7 


1055 


22 


AAB31552 


A human ubiquitin 


4 


5453 


99 


.6 


1075 


22 


AAB74492 


Human SYK kinase b 


5 


5430 


99 


.2 


1087 


22 


AAB31550 


A human ubiquitin 


6 


5430 


99 


.2 


1087 


22 


AAB31556 


A human ubiquitin 


7 


4308 


78 


.7 


834 


22 


AAB74493 


Human SYK kinase b 


8 


3741 


68 


.3 


720 


20 


AAY45343 


Human secreted pro 


9 


344 


6 


.3 


71 


21 


AAG03009 


Human secreted pro 


10 


318.5 


5 


.8 


1264 


14 


AAR36730 


Ubiquitin- specific 


11 


224 


4 


.1 


2547 


19 


AAW81511 


Drosophila fat fac 


12 


215 


3 


.9 


912 


22 


AAB75607 


Human cancer assoc 


13 


214 .5 


3 


.9 


2954 


20 


AAY01632 


Amino acid sequenc 


14 


213.5 


3 


.9 


1517 


21 


AAG38482 


Arabidopsis thalia 


15 


213.5 


3 


.9 


1543 


21 


7AAG38481 


Arabidopsis thalia 



RESULT 8 
AAY45343 

ID AAY4 5343 standard; Protein; 720 AA. 
XX 

AC AAY45343; 
XX 

DT 23-NOV-1999 (first entry) 
XX 

DE Human secreted protein fragment encoded from gene 23 . 
XX 

KW Secreted protein; human; gene therapy; diagnosis; treatment; cancer; 

KW protein therapy; tumor; neurodegenerative disorder; blood disorder; AIDS; 

KW developmental abnormality; leukemia; immune system; autoimmune disease; 

KW hepatic disease; renal disease; inflammation; allergy; schizophrenia; 

KW Alzheimer's disease; cognitive disorder; arthritis; infection; psoriasis; 

KW transplant rejection; diabetes; asthma; sepsis; acne; metabolic disorder; 

KW cardiovascular disorder; food additive; preservative. 

XX 

OS Homo sapiens. 
XX 

PN W09946289-A1 . 
XX 

PD 16-SEP-1999. 
XX 

PF ll-MAR-1999; 99WO-US05721 . 
XX 

PR 12-MAR-1998; 98US- 0077686 . 

PR 12-MAR-1998; 98US-0077687 . 

PR 12-MAR-1998; 98US-0077696 . 

PR 12-MAR-1998; 98US-0077714 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Ruben SM, Ferrie AM, Rosen CA, Florence C, Young PE, Yu G, Ni J; 
XX 

DR WPI; 1999-551363/46. 

DR N-PSDB; 7AAZ27255. 
XX 

PT New isolated human genes, useful for diagnosis and treatment of, e.g. 

PT cancers 

XX 

PS Disclosure; 214-216; 306pp; English. 
XX 

CC This invention describes novel human genes and the secreted proteins 

CC they encode. The polynucleotides and their corresponding secreted 

CC polypeptides are useful for preventing, treating or ameliorating medical 



CC conditions e.g. by protein or gene therapy. Also pathological conditions 

CC can be diagnosed by determining the amount of the new polypeptides in a 

CC sample or by determining the presence of mutations in the new 

CC polynucleotides. Specific uses are described for each of the 

CC polynucleotides of the invention, based on which tissues they are most 

CC highly expressed in, and include developing products for the diagnosis or 

CC treatment of cancer, tumors, neurodegenerative disorders, developmental 

CC abnormalities, blood disorders, leukemias, diseases of the immune system, 

CC autoimmune diseases, hepatic and renal disease, inflammation, allergies, 

CC Alzheimer's and cognitive disorders, schizophrenia, arthritis, 

CC infections, AIDS, transplant rejection, diabetes, asthma, sepsis, acne, 

CC psoriasis, cardiovascular disorders, and metabolic disorders. The 

CC polypeptides or polynucleotides can also be used as food additives or 

CC preservatives. The polypeptides are also useful for identifying their 

CC binding partners. AAY45290-Y45389 represent human secreted protein 

CC fragments described in the method of the invention. 

XX 

SQ Sequence 720 AA; 



Query Match 68.3%; Score 3741; DB 20; Length 720; 

Best Local Similarity 99.7%; Pred. No. 2.5e-2 77; 



Matches 


718; Conservative 0; Mismatches 2; Indels 0; Gaps 


Qy 


336 


CLEAAMIEGEIESLHSENSGKSGQEHWFTGLPPVLTFXLSRFEFNQALGRPEKIHNKLEF 

IIIIIIIIIIIIIIIIIIIIIIIMIIII lllllll IIIIIIIIIIMIIIIIIIIII 
cleaamiegeieslhsensgksgqehwftelppvltfelsrfefnqalgrpekihnklef 


395 


Db 


1 


60 


Qy 


396 


PQVLYLDRYMHRNREITRIKREEIKRLKDYLTVLQQRLERYLSYGSGPKRFPLVDVLQYA 

lllllll lllllll IMIIIMI III lllllll IMIMIII MUM II IIIIMM II 

pqvlyldrymhrnreitrikreeikrlkdyltvlqqrlerylsygsgpkrfplvdvlqya 


455 


Db 


61 


120 


Qy 


456 


LEFASSKPVCTSPVTJDIDASSPPSGSIPSQTLPSTTEQQGALSSELPSTSPSSVAAISSR 


515 


Db 


121 


IMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIM 
lef asskpvctspvddidassppsgsipsqtlpstteqqgalsselpstspssvaaissr 


180 


Qy 


516 


SVIHKPFTQSRIPPDLPMHPAPRHITEEELSVLESCLHRWRTEIENDTRDLQESISRIHR 

IMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIII 


575 


Db 


181 


svihkpf tqsrippdlpmhpaprhiteeelsvlesclhrwrteiendtrdlqesisrihr 


240 


Qy 


576 


TIELMYSDKSMIQVPYRLHAVLVHEGQANAGHYWAYIFDHRESRWMKYNDIAVTKSSWEE 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIII 
tielmysdksmiqvpyrlhavlvhegqanaghywayifdhresrwmkyndiavtksswee 


635 


Db 


241 


300 


Qy 


636 


LVRDSFGGYRNASAYCLMYINDKAQFLIQEEFNKETGQPLVGIETLPPDLRDFVEEDNQR 

MINN MIMM 1 II 1! INI: ! ! : : Mill 1 MIMMMMMI M II 1 1 1 1 !MI 

1 vrds f ggy rnasayc lmy indkaqf 1 iqee f nke tgqpl vg ie t lppdl rdf veednqr 


695 


Db 


301 


360 


Qy 


696 


FEKELEEWDAQLAQKALQEKLLASQKLRESETSVTTAQAAGDPEYLEQPSRSDFSKHLKE 

IIIIIIIIIIIIIIMIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

fekeleewdaqlaqkalqekllasqklresetsvttaqaagdpeyleqpsrsdf skhlke 


755 


Db 


361 


420 


Qy 


756 


ETIQIITKASHEHEDKSPETVLQSAIKLEYARLVKLAQEDTPPETDYRLHHVWYFIQNQ 

IMIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIMIIIIIIMIIMIIIIIIII 
etiqiitkashehedkspetvlqsaikleyarlvklaqedtppetdyrlhhvwyf iqnq 


815 


Db 


421 


480 


Qy 


816 


APKKIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRKFR 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIM 
apkkiiektlleqfgdrnlsfderchnimkvaqaklemikpeevnleeyeewhqdyrkf r 


875 


Db 


481 


540 


Qy 


876 


ETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELISHYRRECLLKL 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIII 
ettmyliiglenfqresyidsllf licayqnnkellskglyrghdeelishyrrecllkl 


935 


Db 


541 


600 


Qy 


936 


NEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEEKDILAVEDMRNRWCSYLGQ 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
neqaae 1 f e sgedr evnngl i imne f i vpf lpl 1 1 vdemeekdi lavedmrnrwcsy Igq 


995 


Db 


601 


660 


Qy 


996 


EMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKLPSYSTHELCERFARIMLSLSRTPADGR 

IIIIIIIIIIIMIIIIIIIIIIMIIIIMIIIIIIIIIIIIIMIIMIIIIIIIMI 

emephlqekltdf lpklldcsmeiksfheppklpsysthelcerf arimlslsrtpadgr 


1055 


Db 


661 


720 



Result 
No. 


Score 


Query 

Match Length DB 


ID 








Description 


1 


327.5 


6 . 0 


1264 


1 


US- 


07 


-789 


-915A-6 


Sequence 6, Appli 


2 


327.5 


6.0 


1264 


1 


us- 


08 


-005 


-002C-6 


Sequence 6, Appli 


3 


327.5 


6 . 0 


1264 


1 


us- 


08 


-487 


-203A-6 


Sequence 6, Appli 


4 


224 


4.1 


2547 


3 


us- 


09 


-058 


-489-35 


Sequence 35, Appl 


5 


210 


3.8 


2555 


3 


us- 


09 


-058 


-489-36 


Sequence 36, Appl 


6 


186.5 


3.4 


1786 


4 


us- 


08 


-973 


-462-8 


Sequence 8, Appli 


7 


178.5 


3.3 


1164 


4 


us- 


08 


-923 


-992A-10 


Sequence 10, Appl 


8 


175 


3.2 


1312 


2 


us- 


08 


-687 


-080-51 


Sequence 51, Appl 


9 


174 


3.2 


1312 


2 


us- 


08 


-592 


-126-148 


Sequence 148, App 


10 


169 


3.1 


809 


1 


us- 


07 


-789 


-915A-4 


Sequence 4, Appli 


11 


169 


3.1 


809 


1 


us- 


08 


-005 


-002C-4 


Sequence 4, Appli 


Result 




Query- 
















No. 


Score 


Match Length DB 


ID 








Description 



1 


4 93 . 5 


9.0 


1302 


2 


1 Z O Z J fa 


t 7-r\ s\ t* V* a +- A no 1 t"*» >~ /-v t- 

iiypouneu luai piroue 


Z 


iZ O . D 


5.9 


1272 


2 




UJJXCJU X U XII bpcLl i-XL 


3 




5.7 


1108 


2 


TA 1 1 OQ 
141100 


prOJJaDXc ILUXLJUX L. XII 






5.7 


1221 


2 


T3 0529 


nhi mn t**in r*A TV^r^v\/l 


5 


287 


5.2 


1130 


2 


T23104 


hypothetical prote 


6 


287 


5.2 


1133 


2 


T23103 


hypothetical prote 


7 


268.5 


4.9 


3187 


2 


JC5837 


364K Golgi complex 


8 


263.5 


4.8 


1230 


2 


T19899 


hypothetical prote 


9 


261.5 


4.8 


1230 


2 


S53974 


hypothetical prote 


10 


239.5 


4.4 


1430 


2 


T21910 


hypothetical prote 


11 


232 


4.2 


1292 


2 


T24559 


hypothetical prote 


Result 




Query 










No. 


Score 


Match Length DB 


ID 


Description 


1 


3836 


70.1 


737 


4 


Q9H9W1 


Q9h9wl homo sapien 


2 


2741 


50.1 


523 


4 


Q9HA22 


Q9ha22 homo sapien 


3 


1895 


34 .6 


757 


4 


Q9P213 


Q9p213 homo sapien 


4 


301 


5.5 


1124 


10 


Q9SF08 


Q9sf08 arabidopsis 


5 


296 


5.4 


1248 


5 


Q9VRP1 


Q9vrpl drosophila 


6 


294 


5.4 


1126 


10 


Q9FG10 


Q9fgl0 arabidopsis 


7 


293 .5 


5.4 


1116 


10 


Q9FPT1 


Q9fptl arabidopsis 


8 


288 .5 


5.3 


599 


10 


Q9FU99 


Q9fu99 arabidopsis 


9 


287 


5.2 


1130 


5 


045624 


045624 caenorhabdi 


10 


287 


5.2 


1133 


5 


045623 


045623 caenorhabdi 


11 


283.5 


5.2 


1141 


3 


Q9P3U0 


Q9p3u0 schizosacch 


12 


268.5 


4.9 


3187 


11 


Q63714 


Q63714 rattus norv 


13 


264 .5 


4.8 


931 


4 


Q9HCH8 


Q9hch8 homo sapien 


14 


264 .5 


4.8 


1129 


3 


Q9UTT1 


Q9uttl schizosacch 


15 


263.5 


4.8 


1230 


5 


Q93371 


Q93371 caenorhabdi 



SEQ ID NO: 4 



Result Query 

No. Score Match Length DB ID Description 



1 


5579 


100. 


.0 


1075 


22 


AAB74492 


Human SYK kinase b 


2 


5453 


97, 


.7 


1055 


22 


AAB74491 


Human SYK kinase b 


3 


5440 


97. 


.5 


1055 


22 


AAB31546 


A human ubiquitin 


4 


5440 


97. 


.5 


1055 


22 


AAB31552 


A human ubiquitin 


5 


5410 


96, 


.9 


1087 


22 


AAB31550 


A human ubiquitin 


6 


5410 


96 , 


.9 


1087 


22 


AAB31556 


A human ubiquitin 


7 


4308 


77 , 


.2 


834 


22 


AAB74493 


Human SYK kinase b 


8 


3721 


66. 


.7 


720 


20 


AAY4 5343 


Human secreted pro 


9 


344 


6 . 


.2 


71 


21 


AAG03 009 


Human secreted pro 


10 


318.5 


5. 


.7 


1264 


14 


AAR36730 


Ubiquitin- specific 


11 


224 


4 . 


.0 


2547 


19 


AAW81511 


Drosophila fat fac 



RESULT 8 
AAY4 5343 

ID AAY45343 standard; Protein; 720 AA. 
XX 

AC AAY45343; 
XX 

DT 23-NOV-1999 {first entry) 
XX 

DE Human secreted protein fragment encoded from gene 23. 
XX 

KW Secreted protein; human; gene therapy; diagnosis; treatment; cancer; 

KW protein therapy; tumor; neurodegenerative disorder; blood disorder; AIDS; 

KW developmental abnormality; leukemia; immune system; autoimmune disease; 

KW hepatic disease; renal disease; inflammation; allergy; schizophrenia; 

KW Alzheimer's disease; cognitive disorder; arthritis; infection; psoriasis; 

KW transplant rejection; diabetes; asthma; sepsis; acne; metabolic disorder; 

KW cardiovascular disorder; food additive; preservative. 

XX 

OS Homo sapiens. 
XX 

PN W09946289-A1. 
XX 

PD 16-SEP-1999. 
XX 

PF ll-MAR-1999; 99WO-US05721 . 
XX 

PR 12-MAR-1998; 98US - 0077686 . 

PR 12-MAR-1998; 98US-0077687 . 

PR 12-MAR-1998; 98US-0077696 . 

PR 12-MAR-1998; 98US- 0077714 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Ruben SM, Ferrie AM, Rosen CA, Florence C, Young PE, Yu G, Ni J; 
XX 

DR WPI; 1999-551363/46. 

DR N-PSDB; AAZ27255 . 
XX 

PT New isolated human genes, useful for diagnosis and treatment of, e.g. 

PT cancers 

XX 

PS Disclosure; 214-216; 306pp; English. 
XX 

CC This invention describes novel human genes and the secreted proteins 

CC they encode. The polynucleotides and their corresponding secreted 

CC polypeptides are useful for preventing, treating or ameliorating medical 

CC conditions e.g. by protein or gene therapy. Also pathological conditions 

CC can be diagnosed by determining the amount of the new polypeptides in a 

CC sample or by determining the presence of mutations in the new 

CC polynucleotides. Specific uses are described for each of the 

CC polynucleotides of the invention, based on which tissues they are most 

CC highly expressed in, and include developing products for the diagnosis or 

CC treatment of cancer, tumors, neurodegenerative disorders, developmental 

CC abnormalities, blood disorders, leukemias, diseases of the immune system, 

CC autoimmune diseases, hepatic and renal disease, inflammation, allergies, 

CC Alzheimer's and cognitive disorders, schizophrenia, arthritis, 

CC infections, AIDS, transplant rejection, diabetes, asthma, sepsis, acne, 

CC psoriasis, cardiovascular disorders, and metabolic disorders. The 

CC polypeptides or polynucleotides can also be used as food additives or 

CC preservatives. The polypeptides are also useful for identifying their 

CC binding partners. AAY4 5290-Y4 53 8 9 represent human secreted protein 

CC fragments described in the method of the invention. 

XX 

SQ Sequence 720 AA; 



Query Match 66.7%; Score 3721; DB 20 

Best Local Similarity 97.0%; Pred. No. 1.9e-278 
Matches 718; Conservative 0; Mismatches 2 



Length 720; 

Indels 20; Gaps 1; 



Qy 336 CLEAAMIEGEIESLHSENSGKSGQEHWFTGLPPVLTFXLSRFEFNQALGRPEKIHNKLEF 395 

iiiiiiiiiiiiiiiiiiiimmiii iiinii iiiiiiiiiiiiMiiiiini 

Db 1 cleaamiegeieslhsensgksgqehwftelppvltfelsrfefnqalgrpekihnklef 60 

Qy 396 PQVL YLDRYMHRNRE ITRIKREEI KRLKD YLT VLQQRLER YL S YG S G P KRF PL VD VLQ YA 455 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 pqvlyldrymhrnreitrikreeikrlkdyltvlqqrlerylsygsgpkrfplvdvlqya 120 

Qy 456 LEFASSKPVCTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALSSELPSTSPSSVAAISSR 515 

IIMIIIIIIIIIIIIMIIIIIMMMMIIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 121 lefasskpvctspvddidassppsgsipsqtlpstteqqgalsselpstspssvaaissr 180 
Qy 516 SVIHKPFTQSRIPPDLPMHPAPRHITEEELSVLESCLHRWRTEIENDTRDLQESISRIHR 575 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 181 svihkpftqsrippdlpmhpaprhiteeelsvlesclhrwrteiendtrdlqesisrihr 24 0 

Qy 576 TIE LMYSDKSM I QVPYRLHAVLVHEGQANAGHYWAY I FDHRE S RWMKYND I AVTKS S WE E 635 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 241 tielmysdksmiqvpyrlhavlvhegqanaghywayifdhresrwmkyndiavtksswee 300 

Qy 636 L VRDS FGG YRNAS AYCLM Y I NDKAQ F L I Q E E FNKETGQ P LVG IETLPPDLRD F VEEDNQR 695 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 
Db 301 lvrdsfggyrnasayclmyindkaqf liqeef nketgqplvgietlppdlrdfveednqr 360 

Qy 696 FEKELEEWDAQLAQKALQEKLLASQKLRESETSVTTAQAAGDPEYLEQPSRSDFSKHLKE 75 5 

I I II 1 I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 361 fekeleewdaqlaqkalqekllasqklresetsvttaqaagdpeyleqpsrsdf skhlke 420 

Qy 756 ETIQIITKASHEHEDKSPETVLQSAIKLEYARLVKLAQEDTPPETDYRLHHWVYFIQNQ 815 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 421 etiqiitkashehedkspetvlqsaikleyarlvklaqedtppetdyrlhhwvyf iqnq 480 

Qy 816 APKKIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEWLEEYEEWHQDYRKFR 875 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I I I I I I I 
Db 481 apkkiiektlleqfgdrnlsfderchnimkvaqaklemikpeevnleeyeewhqdyrkfr 540 

Qy 876 ETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELISHYRRECLLIL 935 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

Db 541 ettmyliiglenfqresyidsllf licayqnnkellskglyrghdeelishyrrecll- - 598 
Qy 936 NLKRKQKPILFFFLHCIKKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEME 995 

III Mill lilllllllllll II MINI II MINI Mill 

Db 599 klneqaaelfesgedrevnngliimnef ivpf lplllvdeme 64 0 

Qy 996 EKDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKLPSYSTHE 1055 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 641 ekdilavedmrnrwcsylgqemephlqekltdf lpklldcsmeiksfheppklpsysthe 700 

Qy 1056 LCERFARIMLSLSRTPADGR 1075 

MINIUM MINIMI 
Db 701 lcerfarimlslsrtpadgr 720 

RESULT 10 
AAR36730 

ID AAR36730 standard; Protein; 1264 AA. 
XX 

AC AAR36730; 
XX 

DT 26-AUG-1993 (first entry) 
XX 

DE Ubiquitin- specif ic protease, UBP2 . 
XX 

KW Fusion protein; ubiquitin-methionine-beta-galactosidase; UBP2 ; UBP3 ; 
KW expression vector; ubiquitin-specif ic; protease; UBP1; in vitro; 
KW transformation; Ub -met -bet a- Gal; in vivo. 
XX 

OS Synthetic. 
XX 

PN WO9309235-A. 



XX 

PD 13-MAY-1993. 
XX 

PF 06-NOV-1992; 92WO-US09562 . 
XX 

PR 08-NOV-1991; 91US-0789915 . 
XX 

PA (MASI ) MASSACHUSETTS INST TECHNOLOGY. 
XX 

PI Baker RT, Tobias JW, Varshavsky A; 
XX 

DR WPI; 1993-167700/20. 

DR N-PSDB; AAQ41289. 
XX 

PT Ubiquitin-specific protease (s) - specifically cleave fusion 

PT proteins regardless of size, at C-terminus of ubiquitin portion 

XX 

PS Claim 6; Page 47-54; 78pp; English. 
XX 

CC The sequences given in AAR36729-31 represent the ubiquitin-specific 

CC proteases UBP1, UBP2 , and UBP3 . These proteases specifically cleave 

CC the 120 kD fusion protein ubiquitin-methionine-beta-galactosidase 

CC (Ub-met-beta-Gal) (see also AAR36728) . The proteases UBP1 and UBP2 

CC demonstrate activity both in vivo and in vitro, whereas UBP3 is only 

CC active in vivo. These proteases have been shown to specifically 

CC cleave Ub-met-beta-Gal at the C-terminus of the ubiquitin moiety. 

CC The proteases can be used to deubiquinate fusion proteins in vivo, 

CC therefore prokaryotic cells having an expression vector one of these 

CC proteases can be further transformed with an expression vector 

CC encoding a ubiquitin fusion protein. Such cells will then produce 

CC a deubiquinated product having a predetermined N- terminal amino acid 

CC residue. 
XX 

SQ Sequence 1264 AA; 



Query Match 5.7%; Score 318.5; DB 14; Length 1264; 

Best Local Similarity 19.6%; Pred. No. 1.8e-15; 

Matches 170; Conservative 121; Mismatches 240; Indels 337; Gaps 

Qy 4 EQNVLQQSAAQKHQQTF LNQLREITGIN DTQ I LQQALKD SNGNLELAV 51 

:|::: I I : I :::| I I - \ I = Ih h 

Db 502 kqdiigqealenallmfkinpkecniselneatllsiykyetsnksqvtsnhltnlknal 561 

Qy 52 AFLTAKNAKTPQQEETTYYQTALPGNDRYISVGSQADTNVIDLTGDD 98 

I || : :: :| | | :: II 11 = t = 

Db 562 rll akytksdklkf yvdhep yralsqaydtlsidesvdediiktaysvkind 613 



Qy 99 KDDLQR TIALS LAESNRAFRET 120 

II I MM I = I II 

Db 614 spglkldcdralytiaiskrsldlfnflteecpqfsnyygpekllqvnenasdetilkif 673 

Qy 121 GITDEEQAI SRVLEASIAENKACL KRTPTEVWRDSRNPY 159 

: : :| : : : : M | : Ml 
Db 674 kqkwfdenvyepdqflilraaltkisiernstlitnflltgtidpnslppenw 726 

Qy 160 DRKRQDKAPVGLKNVGNTCWFSAVIQSLFNLLEFRRLVLNY-KPPSNAQD 208 

I h 1 = 1111= = = = = 1 I" II II I I I I 

Db 727 ptginnigntcylnsllqyyf siaplrryvleyqktvenfndhlsnsghirr 778 

Qy 209 LPRNQKEHRNLPFMRELRYLFALLVGTKRKYVDPSRAV 246 

: | : | |>> |> :|| || =| h = I Ih : 
Db 779 iggrqisrgeve-rsiqf iyqlrnlf yamvhtrercvtpskelaylaf apsnvevef eve 837 

Qy 247 EILKDAF KSNDSQQQD 262 

I III I hi = 

Db 838 gnkwdqtgvlsdskkettddafttkikdtslidlemedglngdvgtdanrkknesndae 897 

Qy 263 VSE FTHKLLDWLEDAFQMKAEEET DEEK 290 

III | ||:| :| =:: |h 

Db 898 vsenedttgltsptrvakissdqlenalemgrqqdvtecignvlfqiesgsepirydedn 957 



Qy 291 PKNPMV-ELFYG RFLAVGVLEGKKFENTEMFGQYPLQVNG 32 9 

= =1 =1111 |||:: : I :| 
Db 958 eqydlvkqlfygttkqsivplsatnkvrtkverflslli nigdhp 1002 

Qy 330 FKDLHECLEAAMIEGEIESLHSENSGKSGQEHWFTGLPPVLTFXLSRFEFNQALGRPEKI 389 

lh== =: : I I I I = | | :| : | ::: | | 

Db 1003 -kdiydafdsyfkd eyltmeeygdvirtvavttfptilqvqiqrvyydrerlmpf ks 1058 

Qy 390 HNKLEFPQVLYLDRYMHRNRE ITRI KREE I KRLKDYLTVLQQRLERYLS Y - GSGPKRFPL 448 

I I =1 = 1-111 = h:| = :| I I- I II II I 
Db 1059 ieplpfkeviymdryadtenplllakkketeemkqklkvmknrqrellsrddsgltr 1115 

Qy 449 VDVLQYALEFASSKPVCTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALSSELPSTSPSS 508 

I ::: | : :|: | = | :::: | :: 

Db 1116 kdaf lesikllesdtikktpl-kiea andviktlrnn 1151 

Qy 509 VAAISSRSVIHKPFTQSRIPPDLPMHPAPRHITEEELSVLESCLHRWRTEIENDTRDLQE 568 

II =11 :: || hi 

Db 1152 vqni dnel raklyndinslee 1171 

Qy 569 S I SRIHRTI ELMYSDKSMIQVPYRLHAVLVHEGQANAGHYWAYI FD -HRESRWMKYNDI A 627 

II =1 = II =1 =1 : III II I :| I Mil 

Db 1172 kish qfdd--f keygyslf svf ihrgeasyrhywiyikdrnrngiwrkyndet 1222 

Qy 62 8 VTKSSWEELVRDSFGGYRNASAYCLMYI 655 

II' H h I hh 

Db 1223 isevqeeevf - -nf negntatpyf lvyv 1248 



Result 




Query 












No. 


Score 


Match Length DB 


ID 




Description 


1 


327.5 


5.9 


1264 


1 


US-07-789 


-915A-6 


Sequence 6, Appli 


2 


327.5 


5.9 


1264 


1 


US-08-005 


-002C-6 


Sequence 6, Appli 


3 


327.5 


5.9 


1264 


1 


US-08-487 


-203A-6 


Sequence 6, Appli 


4 


224 


4.0 


2547 


3 


US-09-058 


-489-35 


Sequence 35, Appl 


5 


210 


3.8 


2555 


3 


US-09-058 


-489-36 


Sequence 36, Appl 


6 


191.5 


3.4 


1786 


4 


US-08-973 


-462-8 


Sequence 8, Appli 


7 


174.5 


3.1 


1164 


4 


US-08-923 


-992A-10 


Sequence 10, Appl 


8 


172 


3.1 


1312 


2 


US-08-687 


-080-51 


Sequence 51, Appl 


9 


171.5 


3.1 


3248 


1 


US-08-353 


-700-1 


Sequence 1, Appli 


10 


171.5 


3.1 


3248 


5 


PCT-US95-: 


L6216-1 


Sequence 1, Appli 


11 


171 


3.1 


1312 


2 


US-08-592 


-126-148 


Sequence 14 8, App 


Result 




Query 












No. 


Score 


Match 


Length DB 


ID 




Description 


1 


485.5 


8.7 


1302 


2 


T23236 




hypothetical prote 


2 


323.5 


5.8 


1272 


2 


S60999 




ubiquitin- specific 


3 


316.5 


5.7 


1108 


2 


T41188 




probable ubiquitin 


4 


310 


5.6 


1221 


2 


T30529 




ubiquitin carboxyl 


5 


287 


5.1 


1130 


2 


T23104 




hypothetical prote 


6 


287 


5.1 


1133 


2 


T23103 




hypothetical prote 


7 


266.5 


4.8 


1230 


2 


T19899 




hypothetical prote 


8 


263 


4.7 


3187 


2 


JC5837 




364K Golgi complex 


9 


253.5 


4.5 


1230 


2 


S53974 




hypothetical prote 



RESULT 1 
T23236 

hypothetical protein K02C4.3 - Caenorhabditis elegans 
C;Species: Caenorhabditis elegans 

C;Date: 15-Oct~1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T23236 
R ; Lightning , J . 

submitted to the EMBL Data Library, January 1995 
A; Reference number: Z19713 
A;Accession: T23236 

A; Status: preliminary; translated from GB/EMBL/DDBJ 



A; Molecule type: DNA 
A;Residues: 1-1302 <WIL> 

A;Cross-referenceS: EMBL: Z47811 ; PIDN: CAA87786 . 1 ; GSPDB:GN00020 ; CESP:K02C4.3 

A; Experimental source: clone K02C4 

C; Genetics : 

A; Gene: CESP:K02C4.3 

A; Map position: 2 

A;Introns: 10/1; 31/3; 84/3; 279/3; 464/1; 745/1; 957/2; 978/3; 1090/2; 1201/3; 1270/3 



Query Match 8.7%; Score 4 85.5; DB 2; Length 1302; 

Best Local Similarity 23.2%; Pred. No. 1.2e-18; 

Matches 237; Conservative 139; Mismatches 353; Indels 293; Gaps 45; 

Qy 90 NVIDLTGDDKDDLQRTIALSLAESNRAFRETGITDEEQAISRVLEASIAENKACLKRTPT 149 

h :: -I I 111 I I III II 
Db 150 NISEI SKKEEVEMQSAIQQSLASS ASQNISR PT 182 

Qy 150 EVWRDSRNPYDRKRQDKAPVGLKNVGNTCWFSAVIQSLFNLLEFRRLVLNYKPPSNAQDL 209 

III III Mill : : I 1=:: Ml : II = 

Db 183 ML MSNLEDMVRNPNFSTGLYNSGNTCWLNCLSQVLYS I PKFRS I LYHCAPLSWHEQP 239 

Qy 210 PRNQK EHRNLPFMRELRYLFALLVGTKRKYVDPSRAVEILKDAFKSND SQQ 260 

II :| I : I III I - II- : = = 11= I 

Db 24 0 I TNVK I ENQQHAE LLML - - FRGL F AE LQ F S EMKYI E VG PL INMVD KLS KS S KG P S T I GTQ 297 

Qy 261 QDVSEFTHKLLDWLEDAF QMKAE - - EETDEEK 290 

II =1 = III: I I |= I Mil 

Db 298 QDAT EMLT L I FDWLQRA FD AALHAQ LN P E F SNVS D E ENLV I SD STTT APNS D 1 1 G A P PG Y 357 

Qy 2 91 PK NPM VELFYGRFLAVGV L 309 

I I III II I : I I 

Db 358 NAANLSLPSSSHVDPKSTLNPMYVNEKEPSSTPTSLFGTRSKTIEVNESMDTEAATSSNL 417 

Qy 310 EGKKFEN TEMFGQ 322 

I II II : 

Db 418 PGNSVENHPNPAAPEVDDNKKAFCDKLKESFNNIFSSVCYTESVAEDGTVSVKSNVRNCP 477 

Qy 323 --YPLQVNGFKDLHECLEAAMIEGEIESLHSENSGKSGQEHWFTGLPPVLTFXLSRFEFN 380 

: III : ! I I I I : I I : : II h I Mil II 

Db 478 QFFQLQVT - YGNLHDALEAATFD HGLGNTASHVRNLYDPLPAVIFFGLSRFSFN 530 

Qy 381 QALGRPEKIHNKLEFPQVLYLDRYMHRNREITRIKREEIKRLKDYLTVLQQRLERYLSYG 440 

|:|:| ||:::::|||: |:| | : :| |: :: :| | 

Db 531 SNI - - ESKLHDKFTFPKI I FMDRYLKCNKEQLVQLRSHRELCRDSLSEVRAKLSGLRRYP 588 

Qy 441 SGPKRFPLVDVLQYALEFASS-KPVCTSPVDDIDASSPPSGSIPSQTLPS TTEQQGA 4 96 

I | | | : |: : | : = :| | | 

Db 589 QGNGEVRLEDSFQTVWQAVSNFREFVTFYL KVSQKTFFSREDAHENTAF 637 

Qy 497 LSSELPSTSPSSVAAISSRSV IHKPFTQSRIPPDLPMHPAPRHITEEELSVLESC 551 

III II Ih I « I I * I « l 

Db 638 VGPLTPSTYQSSSDNCSSKFVKDGGKLFPTFTEGFFPGKAAF IETLQNM 686 

Qy 552 LHRWRTEI ENDTRDLQES I SRIHRTI ELMYSDKSMIQVPYRLHAVLVHEGQANAGHYWAY 611 

I Ml II :|: h I M I MMMI IMI MM I 

Db 687 LEALKTE ERDCLAEEARLQEVIDQTYEVPELQQHKYELHAIIVHSGEANRGHYWTY 742 

Qy 612 1 FDHRE S RWMKYND I AVT KS S WE E L VRD S FG - G YRNA - SA YCLM Y I ND KAQ FL I QE 665 

I I I I M : I :: MM I IM III MM MM: 

Db 743 KLKKSIDGLE-EWEKLNDQNADRVDWPKVESDSFGTGSRDAPSAYMLMYVRSDAEWLVSA 801 

Qy 666 EFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEWDAQLAQKALQEKLLASQKLRES 72 5 

'II I Ihlllh: I = h-l h : I h 

Db 802 D - -KLTA- -LEAFETIPPDLQEKVLQKRDEFKEKL QRFRENKEFNYQQFSVD 849 

Qy 726 ETSVTTAQAAGDPEYLEQPSRSDFSKHLKE-ETIQIITKASHEHEDKSPETVLQSAIKLE 784 

I I I I II : I I I I I : : I | = 

Db 850 SPTVQSTE ETP- -SSFSWYRDELEDIDI GDENANP TKND 886 



Qy 



785 YARLVKLAQEDTP- - PETD YRLHH WVYF I QNQ APKKIIE KTLLE - - 



827 



Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



887 YLLNARLDSYSVPIAPDVETSEMKRMVSQMWNQITKIAPRKYTDSQDLLDSNLRSVMEGE 94 6 

828 QFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRKFRETTMYLII 883 

I : I :| I : I :| : :| || :| : : : « . 

947 SGGINFINSRLGYD- - IHELRSDADNDVEGVYNAFIN- -EYLGLVKDLHELQNSKFWFV 1002 

884 GLENFQRESYIDSLLFLICAYQNNKEL LSKGLYRGHDEE LIS 925 

I : II :: I :|: II II Ih 1=1 

1003 GFQ - LQR - IHVPVLRYLLVRAMAVSELGI I SQRANNELSGMS SNSHDKGTAMLQI ALLLS 1060 

926 HY 927 
h 

1061 HF 1062 



Result 
No. 



Query 

Score Match Length DB 



ID 



Description 



1 


5410 


96 


9 


1087 


1 


UBPP_HUMAN 


Q9uhp3 


homo sapien 


2 


5091 


91 


2 


1055 


1 


UBPP_MOUSE 


P57080 


mus musculu 


3 


485.5 


8 


7 


1302 


1 


UBPY_CAEEL 


Q09931 


caenorhabdi 


4 


323.5 


5 


8 


1272 


1 


UBP2_YEAST 


Q01476 


sac char omyc 


5 


316 . 5 


5 


7 


1108 


1 


UBPB_SCHPO 


Q09879 


schizosacch 


6 


310 


5 


6 


1221 


1 


UBP2_KLULA 


042726 


kluyveromyc 


7 


287.5 


5 


2 


898 


1 


UBPE_DROME 


Q24574 


drosophila 


8 


276 


4 


9 


1102 


1 


UBP7__HUMAN 


Q93009 


homo sapien 


9 


253.5 


4 


5 


1230 


1 


UBPF_YEAST 


P50101 


sac char omyc 


10 


226 


4 


0 


499 


1 


UBPH_YEAST 


P43593 


sac char omyc 


11 


224 


4 


0 


2547 


1 


FAFX_HUMAN 


Q93008 


h probable 



RESULT 1 
UBPP_HUMAN 

ID UBPP_HUMAN STANDARD; PRT; 1087 AA. 

AC Q9UHP3 ; 

DT 30-MAY-2000 (Rel . 39, Created) 

DT 01-OCT-2000 {Rel. 40, Last sequence update) 

DT 01-OCT-2000 {Rel. 40, Last annotation update) 

DE UBIQUITIN CARBOXYL- TERMINAL HYDROLASE 25 (EC 3.1.2.15) (UBIQUITIN 

DE THIOLESTERASE 25) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 25} 

DE {DEUBIQUITINATING ENZYME 25) (USP ON CHROMOSOME 21) . 

GN USP2 5 OR USP21 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI__TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. (SHORT AND LONG ISOFORMS) . 

RC TISSUE=Fetal brain; 

RX MEDLINE=20112753; PubMed=10644437 ; 

RA Valero R., Marfany G., Gonzalez -Angulo 0., Gonzalez -Gonzalez G., 

RA Puelles L., Gonzalez -Duarte R. ; 

RT " USP25 , a novel gene encoding a deubiquitinating enzyme, is located in 

RT the gene-poor region 21qll.2." ; 

RL Genomics 62:395-405(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (SHORT ISOFORM) . 

RX MEDLINE=20080730; PubMed=10612 803 ; 

RA Groet J., Ives J.H., Jones T.A., Danton M. , Flomen R.H., Sheer D., 

RA Hrascan R., Pavelic K., Nizetic D.; 

RT "Narrowing of the region of allelic loss in 21qll-21 in squamous non- 

RT small cell lung carcinoma and cloning of a novel ubiquitin- specific 

RT protease gene from the deleted segment . " ; 

RL Genes Chromosomes Cancer 27:153-161(2000). 

CC -!- FUNCTION: INVOLVED IN THE UBIQUITIN-DEPENDENT PROTEOLYTIC PATHWAY 
CC IN CONJUNCTION WITH THE 26S PROTEASOME (BY SIMILARITY) . 

CC -!- CATALYTIC ACTIVITY: UBIQUITIN C- TERMINAL THIOLESTER + H(2)0 = 
CC UBIQUITIN + A THIOL. 

CC ALTERNATIVE PRODUCTS: 2 ISOFORMS; A LONG FORM (SHOWN HERE) AND A 

CC SHORT FORM; ARE PRODUCED BY ALTERNATIVE SPLICING. 



CC -!- TISSUE SPECIFICITY: FOUND IN MOST ADULT AND FETAL TISSUES; 

CC EXPRESSION IS MODERATELY HIGH IN TESTIS, PANCREAS , KIDNEY, 

CC SKELETAL MUSCLE, LIVER, LUNG, PLACENTA, BRAIN, HEART, BUT VERY LOW 

CC IN PERIPHERAL BLOOD, COLON, SMALL INTESTINE, OVARY, PROSTATE, 

CC THYMUS AND SPLEEN. 

CC -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY C19; ALSO KNOWN AS 
CC FAMILY 2 OF UBIQUITIN CARBOXYL- TERMINAL HYDROLASES. 

CC ~~~ 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL out station - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement {See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; AF170562; AAF32263.1; -- 

DR EMBL; AF134213; AAF24998.1; -. 

DR MIM; 604736; -. 

DR InterPro; IPR001394; -. 

DR Pfam; PF00442; UCH-1; 1. 

DR Pfam; PF00443; UCH-2; 1. 

DR PROSITE; PS00972; UCH_2_1 ; 1. 

DR PROSITE; PS00973; UCH_2_2 ; 1. 

DR PROSITE; PS50235; UCH_2_3 ; 1. 

KW Ubiquitin conjugation; Hydrolase; Thiol protease; Multigene family; 

KW Alternative splicing. 



FT 


ACT_SITE 


178 


178 


BY SIMILARITY . 


FT 


ACT_SITE 


599 


599 


BY SIMILARITY . 


FT 


ACT_SITE 


607 


607 


BY SIMILARITY. 


FT 


VARSPLIC 


780 


811 


MISSING (IN SHORT ISOFORM) 


FT 


CONFLICT 


544 


544 


K - > E (IN REF. 2) . 


SQ 


SEQUENCE 


1087 


AA; 125748 


MW; 0699DDB7451B2C6C CRC64 



Query Match 96.9%; Score 5410; DB 1; Length 1087; 

Best Local Similarity 94.9%; Pred. No. 1.5e-283; 

1051; Conservative 1; Mismatches 3; Indels 52; Gaps 

MTVEQNVLQQS AAQKHQQTFLNQLRE I TG I NDTQ I LQQALKDSNGNLELAVAFLTAKNAK 60 

IIIIIIIMIIIMIIIIIMIIIIIII IIIIIIIIIIIIIIIIIIIIMMIII 

MTVEQNVLQQSAAQKHQQTFLNQLRE I TGINDTQI LQQALKDSNGNLELAVAFLTAKNAK 6 0 
TPQQEETTYYQTALPGNDRYISVGSQADTNVIDLTGDDKDDLQRTIALSLAESNRAFRET 12 0 

iiiiiiiiiiiiiimiiiimiiiiiiMiiiiiiiiiiii Miiiiiiiitiiii 

TPQQEETTYYQTALPGNDRYISVGSQADTNVIDLTGDDKDDLQRAIALSLAESNRAFRET 120 
G I TDE EQ A I S R VLE AS I AENKAC L KRT PT E VWRD S RN P YDRKRQD KAP VGLKNVGNTCW F 180 

IIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIMIMIIIIMMIIIIIII 

GITDEEQAISRVLEASIAENKACLKRTPTEVWRDSRNPYDRKRQDKAPVGLKNVGNTCWF 180 



Matches 


Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



llllllll M ! ! ! Mil llltlllll llllllllll 1 1 I Mlllllll II II II II 

SAVIQSLFNLLEFRRLVLNYKPPSNAQDLPRNQKEHRNLPFMRELRYLFALLVGTKRKYV 



240 



DPSRAVEILKDAFKSNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFY 300 

1 1 1 1 1: ,: 1 1 h:l II M i! II !i II 1 1 il :i 1 1 1 M I M 1 1 , 1 1 ! lillll M 1 1 1; r 

DPSRAVEILKDAFKSNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFY 300 
GRFLAVGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAMIEGEIESLHSENSGKSGQE 360 

II III ; 1 , 1 1 MM II II II II II II II II M I M 1 1 lillll lllllllll I MM 

GRFLAVGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAMIEGEIESLHSENSGKSGQE 360 
HWFTGLPPVLTFXLSRFEFNQALGRPEKIHNKLEFPQVLYLDRYMHRNREITRIKREEIK 420 

MM MIMM MMMMMMMMMMMMMMMMMMMMMMIM 

HWFTELPPVLTFELSRFEFNQALGRPEKIHNKLEFPQVLYLDRYMHRNREITRIKREEIK 420 
RLKDYLTVLQQRLERYLSYGSGPKRFPLVDVLQYALEFASSKPVCTSPVDDIDASSPPSG 480 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMIMI 

RLKD YLTVLQQRLER Y L S YG S G P KRF PL VD VLQ YAL EFASSKPVCTS P VDD I DAS S P P SG 4 80 



Qy 
Db 



481 SIPSQTLPSTTEQQGALSSELPSTSPSSVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHI 540 

IIIMIIIIMIII11I1MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

481 SIPSQTLPSTTEQQGALSSELPSTSPSSVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHI 540 



Qy 


541 


TEEELSVLESCLHRWRTEIENDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVHE 

1 i hi !! Mill III MM llll llllll 1 1 1 1 llllll IMIIMII llllllll ! 1 1 1 


600 


Db 


541 


TEEKLSVLESCLHRWRTEIENDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVHE 


600 


Qy 


601 


GQANAGHYWAYI FDHRESRWMKYNDI AVTKSSWEELVRDS FGGYRNASAYCLMYINDKAQ 
1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 t 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


660 


Db 


601 


1 M 1 1 II M M 1 II 1 1 1 1 1 II M M M 1 M II 1 M M M II I M 1 II II II M M M M 1 

GQANAGHYWAYI FDHRESRWMKYND I AVTKS SWEELVRDS FGGYRNASAYCLMYINDKAQ 


660 


Qy 


661 


FLIQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEWDAQLAQKALQEKLLASQ 

III II M M M li II llll II II llllll M 1 1 M II II II II IMM IM M 1 M 1 M 

FLIQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEWDAQLAQKALQEKLLASQ 


720 


Db 


661 


720 


Qy 


721 


KLRESETSVTTAQAAGDPEYLEQPSRSDFSKHLKEETIQIITKASHEHEDKSPETVLQS- 

MM M 1 1 1 i 1 ; 1 II MINI M II MM 1 1 MM : M | MUM 1 llllll IM 1 

KLRESETSVTTAQAAGDPEYLEQPSRSDFSKHLKEETIQIITKASHEHEDKSPETVLQSI 


779 


Db 


721 


780 


Qy 


780 


AI KLE YARLVKLAQEDT P PETD YRLHH W 


808 


Db 


781 


IMMIIIMIIMMIl!, IMMMM 

MMTPNMQGI I MA I G KS R S VYDRCG P E AG F F KA I KLE YARLVKLAQEDT P P ETD YR LHH W 


840 


Qy 


809 


VYFIQNQAPKKIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWH 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 M 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 M 1 1 M I i 1 1 1 1 1 1 1 

VYFIQNQAPKKIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWH 


868 


Db 


841 


900 


Qy 


869 


QDYRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELISHYR 

IMMMM M II IIIIIMI II M MMM MM MM II IMIIMI IMIMIIMI 

QDYRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELISHYR 


928 


Db 


901 


960 


Qy 


929 


RECLLILNLKRKQKP I LFFFLHCIKKLNEQAAELFESGEDREVNNGLI IMNEFIVPFLPL 

Mill II 1 1 1 1 i i 1 1 ; 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 

RECLL KLNEQAAELFESGEDREVNNGL I IMNEFIVPFLPL 


988 


Db 


961 


1000 


Qy 


989 


LLVDEMEEKDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKL 

1 1 1 1 1 1 1 1 M M M 1 1 M 1 1 1 1 M i 1 1 1 1 1 1 i f 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

LLVDEMEEKDI LAVEDMRNRWCS YLGQEMEPHLQEKLTDFLPKLLDCSME I KS FHEPPKL 


1048 


Db 


1001 


1060 


Qy 


1049 


PSYSTHELCERFARIMLSLSRTPADGR 1075 

llll Mill M 1 MIMIMI II II M 

PSYSTHELCERFARIMLSLSRTPADGR 108 7 




Db 


1061 





RESULT 2 
UBPPJVIOUSE 

ID UBPP_MOUSE STANDARD; PRT; 1055 AA. 

AC P57080; 

DT 01-OCT-2000 (Rel. 40, Created) 

DT 01-OCT-2000 {Rel. 40, Last sequence update) 

DT 01-OCT-2000 (Rel. 40, Last annotation update) 

DE UBIQUITIN CARBOXYL- TERMINAL HYDROLASE 25 (EC 3.1.2.15) (UBIQUITIN 

DE THIOLESTERASE 25) (UBIQUITIN- SPECIFIC PROCESSING PROTEASE 25) 

DE (DEUBIQUITINATING ENZYME 25) (MUSP25) . 

GN USP25. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RX MEDLINE=20112753; PubMed=10644437 ; 

RA Valero R . , Marfany G., Gonzalez -Angulo O., Gonzalez -Gonzalez G., 

RA Puelles L., Gonzalez -Duarte R. ; 

RT "USP25 , a novel gene encoding a deubiquitinating enzyme, is located in 

RT the gene-poor region 21qll.2." ; 

RL Genomics 62:395-405(1999). 

CC FUNCTION: INVOLVED IN THE UBIQUITIN- DEPENDENT PROTEOLYTIC PATHWAY 

CC IN CONJUNCTION WITH THE 26 S PROTEASOME (BY SIMILARITY) . 



CC -!- CATALYTIC ACTIVITY: UBIQUITIN C- TERMINAL THIOLESTER + H(2)0 = 
CC UBIQUITIN + A THIOL. 

CC -!- TISSUE SPECIFICITY: HIGHLY EXPRESSED IN TESTIS . ALSO FOUND IN 
CC BRAIN, SKELETAL MUSCLE, LIVER AND HEART. 

CC -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY C19; ALSO KNOWN AS 
CC FAMILY 2 OF UBIQUITIN CARBOXYL- TERMINAL HYDROLASES. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AF170563; AAF32264.1; -. 

DR MGD; MGI : 1353655 ; Usp25 . 

DR PROSITE; PS00972; UCH_2_1 ; 1. 

DR PROSITE; PS00973; UCH_2_2 ; 1. 

DR PROSITE; PS50235; UCH_2_3 ; 1. 

KW Ubiquitin conjugation; Hydrolase; Thiol protease; Multigene family. 

FT ACT_SITE 178 178 BY SIMILARITY. 

FT ACT_SITE 600 600 BY SIMILARITY. 

FT ACT_SITE 608 608 BY SIMILARITY. 

SQ SEQUENCE 1055 AA; 121426 MW; CDFDFD1052745D70 CRC64 ; 



Query Match 91.2%; Score 5091; DB 1; Length 1055; 

Best Local Similarity 91.5%; Pred. No. 2.1e-266; 

Matches 985; Conservative 29; Mismatches 40; Indels 22; Gaps 3; 

Qy 1 MTVEQNVLQQSAAQKHQQTFLNQLREITGINDTQILQQALKDSNGNLELAVAFLTAKNAK 60 

I I I 1 I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I 

Db 1 MTVEQNVLQQSAAQKHQQTFLNQLREITGINDAQILQQALKDSNGNLELAVAFLTAKNAK 60 

Qy 61 TPQQEETTYYQTALPGNDRYISVGSQADTNVIDLTGDDKDDLQRTIALSLAESNRAFRET 120 

II 1 1 1 1 I ;! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllllllll IIIIMIIIIIIIII 

Db 61 TPPQEETGYYQTALPGNDRYISVGSQADANVIDLTGDDKDDLQRAIALSLAESNRAFRET 120 

Qy 121 GITDEEQAISRVLEASIAENKACLKRTPTEVWRDSRNPYDRKRQDKAPVGLKNVGNTCWF 180 

IIIIIIIMIIMIIIIIIMIIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II Mil I 

Db 121 G I TDE EQAI S RVL E AS I AENKAC LKRT P I E VWRD S RN P YD RKRQ E KA P VGLKNVGNT CW F 180 

Qy 181 SAVIQSLFNLLEFRRLVLNYKPPSNAQDLPRNQKEHRNLPFMRELRYLFALLVGTKRKYV 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 181 SAVIQSLFNLLEFRRLVLNYKPPSNAQDLPRNQKEHRNLPFMRELRYLFALLVGTKRKYV 240 

Qy 241 DPSRAVEILKDAFKSNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFY 300 

II II 1 1 1 1: INI I Mill MM II II I! 1 1 1 1 ! I II II I! Illlll I II 1 1 1! MM II 

Db 241 DPSRAVEILKDAFKSNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFY 300 
Qy 301 GRFLAVGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAMIEGEIESLHSENSGKSGQE 360 

1 1 1 1 h 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II I II 1 1 

Db 301 GRFLAMGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAMIEGEIESLHSDNSGKSGQE 360 

Qy 361 HWFTGLPPVLTFXLSRFEFNQALGRPEKIHNKLEFPQVLYLDRYMHRNREITRIKREEIK 420 

I I I I I I I I I I I M I I I I I I I I I II I I I I I I h I I I I I II I I I I I I I I I II I II I I I I I 
Db 361 HWFTELPPVLTFELSRFEFNQALGRPEKIHNKLKFPQVLYLDRYMHRNREITRIKREEIK 420 

Qy 421 RLKDYLTVLQQRLERYLSYGSGPKRFPLVDVLQYALEFASSKPVCTSPVDDIDASSPPSG 480 

1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 i t e 1 1 1 1 1 1 1 e 1 1 1 1 i i r 1 1 1 1 1 1 1 1 1 1 1 [ i e 1 1 e 1 1 i 1 1 ii 

Db 421 RLKDYLTVLQQRLERYLSYGSGPKRFPLVDVLQYALEFASSKPVCTSPVDDIDASSSASG 4 80 
Qy 4 81 SIPSQTLPSTTEQQGALSSELP-STSPSSVAAISSRSVIHKPFTQSRIPPDLPMHPAPRH 539 

M II M I M 1 1 1 1 1 MMI MIMI IM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 481 PLPSQSLPSTTEQQGLCASDLPGSSSPASGAALPLRSVIHKPFTQSRIPPDLPMHPAPRH 540 



Qy 

Db 



540 
541 



ITEEELSVLESCLHRWRTEIENDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVH 

Illlll IMIIIIIMIIIIIIIIIMIIIIIMIMIMMIIMMIIMIIIIIII 

ITEEELCVLESCLHRWRTEIENDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVH 



599 
600 



Qy 600 EGQANAGHYWAYIFDHRESRWMKYNDIAVTKSSWEELVRDSFGGYRNASAYCLMYINDKA 659 

II MMIIIMI II II Ml 1 1 1 1 M MIMI IMMIII I M II M 1 1 1 Ml MM MM 

Db 601 EGQANAGHYWAYIFDHRESRWMKYNDIAVTKSSWEELVRDSFGGYRNASAYCLMYIDDKA 660 

Qy 660 QFLIQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEWDAQLAQKALQEKLLAS 719 

llllllllllllll llhlll llllllllllllllllllllll 
Db 661 QFLIQEEFNKETGQALVGMETLLPDLRDFVEEDNQRFEKELEEWDTQIAQRSLQEKLLAA 720 



Qy 


720 


QKLRESETSVTTAQAAGDPEYLEQPSRSDFSKHLKEETIQIITKASHEHEDKSPETVLQS 779 
1 1 1 1 . 1 1 1 1 1 1 1 1 . 1 1 1 1 1 1 II 1 III 1 1 1 1 ... 1 Mil. Ill 1 1 1 II 1 1 


Db 


721 


1 M I • 1 1 Mill 1 : 1 1 1 1 M 1 M 1 M lll ::: l 1111= Ml 1 1 II 1 1 1 

PKLREAEASATTAQAGG-ADYLEQPSRSDLSKHWKEETLRVIAKASHDLEDKGPETVLQS 779 


Qy 


780 


AIKLEYARLVKLAQEDTPPETDYRLHHVWYFIQNQAPKKI IEKTLLEQFGDRNLSFDER 83 9 

1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M 1 ! 1 H 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 

AI KLEYSRLVKLAQEDTPPETDYRLHHVLVYFI QNQAPKKI I EKTLLEQFGDRNLS FDER 839 


Db 


780 


Qy 


840 


CHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRKFRETTMYLIIGLENFQRESYIDSLLF 899 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMMMI MMMMMMMM 

CLNIMKVAQAKLEMIKPEEVNLEEYEEWHADYKKFRETTMYLITGLENFQRESYIDSLLF 899 


Db 


840 


Qy 


900 


LICAYQNNKELLSKGLYRGHDEELISHYRRECLLILNLKRKQKPILFFFLHCIKKLNEQA 959 


Db 


900 


h 1 1 1 f 1 1 1 1 1 1 1 1 1 MMI MMMMMM MMM 

LLCAYQNNKELLSKGPYRGHDGELISHYRRECLL KLNEQA 939 


Qy 


960 


AELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEEKDILAVEDMRNRWCSYLGQEMEP 1019 


Db 


940 


II IIIIIM M II II 1 II II II 1 1 1 1 M II II M M 1 II 1 II M II II M 1 II 1 1 II 1 

AELFESGEDGDVNNGLIIMNEFIVPFLPLLLVDDMEEKDILAVEDMRNRWCSYLGQEMEA 999 


Qy 


1020 


HLQEKLTDFLPKLLDCSMEIKSFHEPPKLPSYSTHELCERFARIMLSLSRTPADGR 1075 

: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M III IIIMIIMM MIIIIMIIIIIIIIIIIIM 

NLQEKLTDFLPKLLDCSTEIKGFHEPPKLPSYSAHELCERFARIMLSLSRTPADGR 1055 


Db 


1000 



Result Query 



STo. 


Score 


Match Length DB 


ID 


Description 


1 


3816 


68.4 


737 


4 


Q9H9W1 


Q9h9wl homo sapien 


2 


2721 


48.8 


523 


4 


Q9HA22 


Q9ha22 homo sapien 


3 


1875 


33.6 


757 


4 


Q9P213 


Q9p213 homo sapien 


4 


303 


5.4 


1124 


10 


Q9SF08 


Q9sf08 arabidopsis 


5 


297.5 


5.3 


1116 


10 


Q9FPT1 


Q9fptl arabidopsis 


6 


297 


5.3 


1126 


10 


Q9FG10 


Q9fgl0 arabidopsis 


7 


296 


5.3 


1248 


5 


Q9VRP1 


Q9vrpl drosophila 


8 


288.5 


5.2 


599 


10 


Q9FU99 


Q9fu99 arabidopsis 


9 


287 


5.1 


1130 


5 


045624 


045624 caenorhabdi 


10 


287 


5.1 


1133 


5 


045623 


04 5623 caenorhabdi 


11 


283.5 


5.1 


1141 


3 


Q9P3U0 


Q9p3u0 schizosacch 



RESULT 1 
Q9H9W1 

ID Q9H9W1 PRELIMINARY; PRT; 737 AA. 

AC Q9H9W1 ; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last annotation update) 

DE CDNA FLJ12512 FIS, CLONE NT2RM2001730 , WEAKLY SIMILAR TO PROBABLE 

DE UBIQUITIN CARBOXYL- TERMINAL HYDROLASE K02C4.3 (EC 3.1.2.15). 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Isogai T., Ota T., Hayashi K. , Sugiyama T., Otsuki T., Suzuki Y., 

RA Nishikawa T . , Nagai K . , Sugano S., Aotsuka S., Yoshikawa Y. , 

RA Matsunawa H., Ishii S., Kawai Y., Saito K. , Yamamoto J., Wakamatsu A., 

RA Nakamura Y., Nagahari K. , Masuho Y. , Sasaki N. ; 

RT "NEDO human cDNA sequencing project."; 

RL Submitted (AUG-2000) to the EMBL /GenBank/DDB J databases. 



DR EMBL; AK022574 ; BAB14107.1; -. 

SQ SEQUENCE 737 AA; 85842 MW; D660BB2A18077E10 CRC64; 



Query Match 68.4%; Score 3816; DB 4; Length 737; 

Best Local Similarity 97.1%; Pred. No. 6.4e-221; 

735; Conservative 0; Mismatches 2; Indels 20; Gaps 

MFGQYPLQVNGFKDLHECLEAAMIEGEIESLHSENSGKSGQEHWFTGLPPVLTFXLSRFE 378 

IIIIIIIIIMIIIIIIIIIII1IIIIIIIIIIIIIIIIIIIIIII Mlllll Mill 

MFGQYPLQVNGFKDLHECLEAAMIEGEIESLHSENSGKSGQEHWFTELPPVLTFELSRFE 60 
FNQALGRPEKIHNKLEFPQVLYLDRYMHRNREITRIKREEIKRLKDYLTVLQQRLERYLS 438 

1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 

FNQALGRPEKIHNKLEFPQVLYLDRYMHRNREITRIKREEIKRLKDYLTVLQQRLERYLS 12 0 
YGSGPKRF PLVDVLQYALE FAS S KPVCTS P VDD I DAS S P P SG S I PSQTLP STTEQQGALS 4 98 

1 1 1 1 1 M 1 1 1 M i 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 i 1 1 i 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 It t 

YGSGPKRFPLVDVLQYALEFASSKPVCTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALS 180 
SELPSTSPSSVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHITEEELSVLESCLHRWRTE 558 

MMI II I M 1 1 IN 1 1 1 ! 1 1 1 II 1 1 1 M M 1 1! 1 1 II 1 1 II M 1 1 1 M I i I i 1 1 1 1 1 1 1 

SELPSTSPSSVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHITEEELSVLESCLHRWRTE 240 
I ENDTRDLQE S I SR I HRT I ELMYSDKSM I QVP YRLHAVLVHEGQANAGHYWAYI FDHRE S 618 

I 1 1 1 1 M I II I hi 1 1 h I! 1 1 II 1 1 1 1 1 1 1 M II II III! 1 1 - i ' M 1 1 i : I II II 

IENDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVHEGQANAGHYWAYIFDHRES 300 
RWMKYNDIAVTKSSWEELVRDSFGGYRNASAYCLMYINDKAQFLIQEEFNKETGQPLVGI 678 

I E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 



ET L P PD LRD F VE EDNQR FE KELEE WDAQLAQKALQE KLLAS QKLRE S ETS VTTAQAAGD P 738 

1 1 1 1 1 1 1 1 1 1 i I M 1 1 1 F I E 1 1 1 1 1 1 1 1 1 1 1 i f 1 1 1 M I E 1 1 1 1 1 ( I f 1 1 1 1 1 M 1 1 1 1 1 

ETLPPDLRDFVEEDNQRFEKELEEWDAQLAQKALQEKLLASQKLRESETSVTTAQAAGDP 420 

EYLEQPSRSDFSKHLKEETIQIITKASHEHEDKSPETVLQSAIKLEYARLVKLAQEDTPP 798 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E I E 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E E 

EYLEQPSRSDFSKHLKEETIQIITKASHEHEDKSPETVLQSAIKLEYARLVKLAQEDTPP 480 



IIIIMIIIIMIMIMMIMMMIMIMIIIIIIMIMMMMIMM 



VNLEEYEEWHQDYRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRG 918 

1 11 1 1 1 1 1 1 II 1 1 1 1 1 II II I M I M II I M I II 1 1 1 1 1 1 M I M 1 1 1 II 1 1 1 II I II II 

VNLEEYEEWHQDYRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRG 600 



1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 I II II II M 1 1 1 1 M 1 1 1 1 1 1 III 

HDEELI SHYRRECLL KLNEQAAELFESGEDREVNNGLI I M 6 4 0 

NEFIVPFLPLLLVDEMEEKDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSME 103 8 

MIIIIIMIIIIIIIIIIIIMMIIIIIMIIIIIIMMIIIIIIIIIMIIIIIII 

NEFIVPFLPLLLVDEMEEKDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSME 70 0 
IKSFHEPPKLPSYSTHELCERFARIMLSLSRTPADGR 1075 

1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 i 1 1 1 1 E 1 1 1 

IKSFHEPPKLPSYSTHELCERFARIMLSLSRTPADGR 737 



Matches 


Qy 


319 


Db 


1 


Qy 


379 


Db 


61 


Qy 


439 


Db 


121 


Qy 


499 


Db 


181 


Qy 


559 


Db 


241 


Qy 


619 


Db 


301 


Qy 


679 


Db 


361 


Qy 


739 


Db 


421 


Qy 


799 


Db 


481 


Qy 


859 


Db 


541 


Qy 


919 


Db 


601 


Qy 


979 


Db 


641 


Qy 


1039 


Db 


701 



